Articulated human pose estimation in natural images

نویسنده

  • Samuel Alan Johnson
چکیده

In this thesis the problem of estimating the 2-D articulated pose, or configuration of a person in unconstrained images such as consumer photographs is addressed. Contributions are split among three major chapters. In previous work the Pictorial Structure Model approach has proven particularly successful, and is appealing because of its moderate computational cost. However, the accuracy of resulting pose estimates has been limited by the use of simple representations of limb appearance. In this thesis strong discriminatively trained limb detectors combining gradient and colour segmentation cues are proposed. The approach improves significantly on the “iterative image parsing” method which was the state-of-the-art at the time, and shows significant promise for combination with other models of pose and appearance. In the second part of this thesis higher fidelity models of pose and appearance are proposed. The aim is to tackle extremely challenging properties of the human pose estimation task arising from variation in pose, anatomy, clothing, and imaging conditions. Current methods use simple models of body part appearance and plausible configurations due to limitations of available training data and constraints on computational expense. It is shown that such models severely limit accuracy. A new annotated database of challenging consumer images is introduced, an order of magnitude larger than currently available datasets. This larger amount of data allows partitioning of the pose space and the learning of multiple, clustered Pictorial Structure Models. A relative improvement in accuracy of over 50% is achieved compared to the standard, single model approach. In the final part of this thesis the clustered Pictorial Structure Model framework is extended to handle much larger quantities of training data. Furthermore it is shown how to utilise Amazon Mechanical Turk and a latent annotation update scheme to achieve high quality annotations at low cost. A significant increase in pose estimation accuracy is presented, while the computational expense of the framework is improved by a factor of 10.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Camera Pose Estimation in Unknown Environments using a Sequence of Wide-Baseline Monocular Images

In this paper, a feature-based technique for the camera pose estimation in a sequence of wide-baseline images has been proposed. Camera pose estimation is an important issue in many computer vision and robotics applications, such as, augmented reality and visual SLAM. The proposed method can track captured images taken by hand-held camera in room-sized workspaces with maximum scene depth of 3-4...

متن کامل

Combined discriminative and generative articulated pose and non-rigid shape estimation

Estimation of three-dimensional articulated human pose and motion from images is a central problem in computer vision. Much of the previous work has been limited by the use of crude generative models of humans represented as articulated collections of simple parts such as cylinders. Automatic initialization of such models has proved difficult and most approaches assume that the size and shape o...

متن کامل

3-d Hand Pose Estimation and Shape Model Refinement from a Monocular Image Sequence

This paper proposes a method to precisely estimate the shape and pose of articulated objects like a human hand. First, rough estimation is obtained using silhouette matching. Next, we apply the extended Kalman lter to tting a model to an image. However, because monocular images contain no depth information, ambiguity of the shape and pose cannot essentially be resolved for articulated objects. ...

متن کامل

Shape Models of the Human Body for Distributed Inference

of “Shape Models of the Human Body for Distributed Inference” by Silvia Zuffi, Ph.D., Brown University, May 2015 In this thesis we address the problem of building shape models of the human body, in 2D and 3D, which are realistic and efficient to use. We focus our efforts on the human body, which is highly articulated and has interesting shape variations, but the approaches we present here can b...

متن کامل

Action recognition feedback-based framework for human pose reconstruction from monocular images

A novel framework based on action recognition feedback for pose reconstruction of articulated human body from monocular images is proposed in this paper. The intrinsic ambiguity caused by perspective projection makes it difficult to accurately recover articulated poses from monocular images. To alleviate such ambiguity, we exploit the high-level motion knowledge as action recognition feedback t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012